The statistical analysis of multivariate serological frequency data.
نویسنده
چکیده
Data occurring in the form of frequencies are common in genetics-for example, in serology. Examples are provided by the AB0 group, the Rhesus group, and also DNA data. The statistical analysis of tables of frequencies is carried out using the available methods of multivariate analysis with usually three principal aims. One of these is to seek meaningful relationships between the components of a data set, the second is to examine relationships between populations from which the data have been obtained, the third is to bring about a reduction in dimensionality. This latter aim is usually realized by means of bivariate scatter diagrams using scores computed from a multivariate analysis. The multivariate statistical analysis of tables of frequencies cannot safely be carried out by standard multivariate procedures because they represent compositions and are therefore embedded in simplex space, a subspace of full space. Appropriate procedures for simplex space are compared and contrasted with simple standard methods of multivariate analysis ("raw" principal component analysis). The study shows that the differences between a log-ratio model and a simple logarithmic transformation of proportions may not be very great, particularly as regards graphical ordinations, but important discrepancies do occur. The divergencies between logarithmically based analyses and raw data are, however, great. Published data on Rhesus alleles observed for Italian populations are used to exemplify the subject.
منابع مشابه
Analysis of physiochemical and microbial quality of waters of the Karkheh River in southwestern Iran using multivariate statistical methods
Rapid population growth as well as agricultural and industrial development have increased the contamination of Iranian rivers. This study utilized principal components analysis (PCA) to determine the degree of significance of qualitative parameters of water resources in the Karkheh River in southwestern Iran. Cluster analysis (CA) grouped the monitoring stations based on the water quality data ...
متن کاملMultivariate statistical analyzing of chemical parameters of thermal and non-thermal springs of Mahalat area in Iran
In this study multivariate statistical analysis are used to characterize relationships between hydrochemical properties ofthermal and non-thermal springs. Four factors for thermal waters and two for non-thermal springs were extracted basedon factor analysis. In thermal springs, the first factor showed high loading on Ca, Mg, Na and K and this factor wasinterpreted as leaching of cations in the ...
متن کاملMultivariate Statistical Analysis Decision-making Hybrid Method for Road Traffic Safety Evaluation in Iran
Obviously, improving the road safety and the efficient allocation of limited resources to the provinces according to their ranking should be done. This paper presents a hybrid method of multivariate statistical analysis-decision making to evaluate Iran road traffic safety. In order to solve the problems of road traffic safety, a macroscopic evaluation and traffic safety level classification in ...
متن کاملRelationship between Yield and its Component in Soybean Genotypes (Glycine Max L.) using Multivariate Statistical Methods
18 soybean genotypes were examined to investigate the relationships between some principal attributions of morphology with seed yield per soybean, by Random Complete Block Design (RCBD) study. This study was also carried out three replicates to gain reliable results. The results of variance analysis indicated that, there were significance differences among all soybean genotypes. Moreover, the r...
متن کاملA Clustering Based Location-allocation Problem Considering Transportation Costs and Statistical Properties (RESEARCH NOTE)
Cluster analysis is a useful technique in multivariate statistical analysis. Different types of hierarchical cluster analysis and K-means have been used for data analysis in previous studies. However, the K-means algorithm can be improved using some metaheuristics algorithms. In this study, we propose simulated annealing based algorithm for K-means in the clustering analysis which we refer it a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bulletin of mathematical biology
دوره 67 6 شماره
صفحات -
تاریخ انتشار 2005